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Abstract. We characterize normalization by evaluation as the compositi on of a self - interp reter 
with a self-reducer using a special representation scheme, in the sense of Mogensen] ll992l) . We 
do so by deriving in a systematic way an untyped normalization by evaluation algorithm from 
a standard interpreter for the A-calculus. The derived algorithm is not novel and indeed other 
published algorithms may be obtained in the same manner through appropriate adaptations to 
the representation scheme. 

1. Self-interpreters and self-reducers 

What is a self-interpreter? For the untyped A-calculus . IMogensenI (|l992|) offers the following 
definition, given an injective mapping r - n (the representation scheme) that yields representations of 
arbitrary terms: 

E V W = p M. 

That is, a self -interpreter is a term E of the A-calculus such that when applied to the represen- 
tation r M n of any term M, the result is a convertible term (modulo renaming). 

The mapping cannot of course be defined within the A-calculus itself, but we posit its 
existence as a primitive operation of the calculus. The representation of a term is a piece of 
data, something that can be manipulated, transformed and inspected within the calculus itself. 
It is natural to represent data as terms in normal form, so that data may be regarded as constant 
with regard to term reduction. Consider the following grammar for terms and normal terms: 

Var a x, y, z 



Term 3 t 
Term id TerrriNF 3 t„ 
Term 3 TerrriA 3 t„ 



= x | Ax.t | 1 1 

= t„ | Ax.tn 
— X \t a t n 



The representation scheme can be typed as r - n : Term — > TerrriNF. 

All manner of representation schema are possible, but Mosensen commits to a particu- 
larly simple representation scheme, one that enables him to implement a trivially simple self- 
interpreter that not only yields convertible terms from their representations, but in fact whose 
weak head normal form when applied to a normal term M is identical to M, up to renaming of 
variables. Let us call this particular self- interpreter E a . We have that 

E a V NT -^ whnf M. 



Mogensen goes on to define a self -reducer as a transformation on representations: 

R r NV =p r NF M \ 

where NF^ stands for the normal form of M, if one exists. Equipped with such a contraption, we 
can define a special kind of self -interpreter with the additional property that all representations 



^This is an analogue of the quote special form of Scheme. 
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of terms evaluate to normal forms. For all M, 

E NF r M'=E a (R r Af) -^ whnf NF M . 

After a small detour towards concrete implementations of the above, we will show how 
through successive transformations we may obtain a n untyped norma lization by evaluation 
algorithm, in fact precisely the algorithm presented in (jBoespflu gj, 2009). This algorithm is but 
one of several variants of untyped normalization by evaluation algorithms, an d indeed the 
transf orm ations presented here can r eadily be adapted to derive the algorithms of Aehli g et al 



(2008) and Filinski and Rohde (2004). Because the starting points of these transformations are 
in fact standard definitions of evaluators and normalizers, it is possible to obtain the correctness 
of the normalization by evaluation algorithm as a corollary of the correctness of the meaning 
preserving transformations we use. 

2. Implementation 

In the following, we assume a A-calculus with matching constructs, for convenience. Note 
however, that we could always reformulate the following in the pure A-calculus via a Church 
encoding. 

Let us recall the representation scheme given in (Mo gensenL ll992): 

V = Var x 
r Axf = Lam (Ax.T) 

% fr = A PP v v 

for distinguished constructors Var , Lam and App . Notice how the representation scheme uses 
higher order abstract syntax (HO AS). With this representation scheme, the definition of the E a 
self -interpreter above can be given as 

E a (Var x) = x 

E a (Lam f) = t 

E a (App f h) = (Ea t Q ) (E a h) 

Given a datatype for terms, and assuming the metalanguage follows a call-by-value eval- 
uation strategy, an evaluator yielding weak head normal forms using a call-by-name strategy 
might typically be defined as follows: 

eval (Var x) = Var x 

eval (Lam t) = Lam t 

eval (App t h) = case eval t Q of 

Lam t — > eval (t fi) 

t' Q -> App t' Q (eval h) 

Now a single change to the above gives us the definition of a normalizer: 

norm (Var x) = Var x 

norm (Lam t) = Lam (Ax.norm (t x)) 

norm (App to h) = case norm t of 

Lam t — > tt\ 

t' —> App t' (norm h) 

Whereas the evaluator does not traverse the boundary of a binder to fetch redexes beneath it 
(an abstraction is a value), a normalizer on the other hand does, so we move the recursive call 
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to norm inside the abstraction. There are no more recursive calls in the definition of norm than 
there are in the definition of eval — one of them merely changed place. 

A slight modification gives a normalizer following the call-by-value strategy. Namely, we 
normalize the argument before applying any abstraction to it. 

norm (Var x) = Var x 
norm (Lam f) = Lam (Ax.norm (fx)) 
norm (App f *i) = case norm f of 
Lam t — > t (norm h) 

t' —> App t' Q (norm t{) 

Notice that the call-by-value and call-by-name normalizers just defined both fit the bill as self- 
reducers: they take representations of terms to representations in normal form. That is, if one is 
willing to forgo the incompleteness of the call-by-value normalizer: there are terms for which 
there exists a normal form that may never be reached using a call-by-value strategy. 

The need to define two separate normalizers for two different normalization strategies is 
rather unfortunate. But if we restrict the input terms to a certain shape, namely those terms in 
continuation passing style (CPS), then there is only one normalization strategy possible, because 
at any given step in the computation, the only possible redex outside of any abstraction is to 
be found at the head of the term, if any. Hence only one normalizer need be built. But more 
importantly, it will justify a change in the representation scheme allowing for a much more 
efficient self -reducer than is possible with the current representation scheme. 



3. Shifting representation scheme 

A CPS transformation takes terms in Term to terms in Termcps/ a language generated by the 
following grammar. 

Term 3 Termv 3 v ::=x\ Ax.t c 
Term 3 Termcps 3 t c ::= v \ v v 

By modifying the representation scheme to return representations in continuation passing style 
instead, we effectively encode the reduction strategy of the normalizer into the scheme itselfl 
But on the flip side, we get a simpler definition for the normalizer: 

norm (Var x) = Var x 
norm (Lam t) = Lam (Ax.norm (t x)) 
norm (App (Lam f ) h) = norm (f £1) 
norm (App (Var x) t\) = App (Var x) (norm h) 

This definition is a mere specialization of the above definitions to terms in CPS, noting that 
such terms do not contain nested applications. We can break out the interpretation of ap- 
plication nodes to an auxiliary app function, using the fact that App (Var x) (norm ti) = 



Alternatively, we could leave the representation scheme untouched and have the CPS transform as a function from 
representations to representations, which we intercalate between quoting and normalizing. 
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norm (App (Var x) fi) for all x, t\\ 

norm (Var x) = Var x 
norm (Lam t) = Lam (Ax.norm (t x)) 
norm (App t t\) = norm (app t fj) 



app (Lam f ) h = k h 
app (Var x) t\ = App (Var x) h 



4. From here to there: normalization by evaluation 

Normalization by evaluation corresponds to interpreting terms as their representation yet 
identifying entire classes of terms with unique representations. In our setting, this is achieved by 
inlining the third clause of norm in place of every occurrence of an App node in the representation 
of the term to normalize. As an optimization, we can inline only the call to app, not the recursive 
call to norm, yielding the following final definition for norm, 

norm (Var x) = Var x 
norm (Lam t) = Lam (Ax.norm (t x)) 
norm (App t h) = App f (norm t{] 

where we use the fact that after inlining of app all App nodes that remain are applicative forms, 
i.e. bearing a variable as left branch. The associated interpretation function reads as follows: 

M = Var x 
fAx.q = Lam (Ax.p]) 
po hj = app po]] pj] 

This interpretation function can be read as a particular quote operation where terms are iden- 
tified modulo weak head normal forms. 
Normalization by evaluation is 

nbe t = E a (norm p]]) 



5. Conclusion 

We have gone from a self-reducer to an equivalent self -reducer on representations in CPS. By 
inlining part of the obtained self-reducer into the representation of terms, we further obtained 
a preexisting normalization by evaluation algorithm (which also works for terms in direct 
style). In short, for terms in CPS, untyped normalization by evaluation is the composition of a 
self -reducer with an appropriate quote operation. 

We have derived a more efficient self -reducer than the one presented in (;Moge nsen1.ll99^) . 
This derivation also serves as a road map for an alternative and simple proof of correctness of 
untyped normalization by evaluation for terms in CPS. 
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